GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning

نویسندگان

چکیده

Generalized Zero-Shot Learning (GZSL) aims to recognize images not only for seen classes but also unseen ones by transferring semantic-visual relationships from the classes. It is an intuitive solution take advantage of generative models hallucinate realistic samples based on knowledge learned However, due generation shifts, synthesized most existing methods may drift real distribution data. To address this issue, we propose a novel flow-based framework that consists multiple conditional affine coupling layers learning data generation. Specifically, investigate and three essential problems trigger i.e., semantic inconsistency, variance collapse, structure disorder. First, improve reflection semantic information in generated samples, proactively embed into transformation each layer. Second, promote intrinsic feature variance classes, introduce boundary sample mining strategy with entropy maximization discover ambiguous visual variants prototypes hereby calibrate decision classifiers. Third, relative positioning proposed revise attribute embeddings, guiding which fully preserve inter-class geometric structure further avoid disorder space. Extensive experimental results four GZSL benchmark datasets demonstrate GSMFlow achieves state-of-the-art performance GZSL.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

Prevalent techniques in zero-shot learning do not generalize well to other related problem scenarios. Here, we present a unified approach for conventional zero-shot, generalized zero-shot and few-shot learning problems. Our approach is based on a novel Class Adapting Principal Directions (CAPD) concept that allows multiple embeddings of image features into a semantic space. Given an image, our ...

متن کامل

Generalized Zero-Shot Learning via Synthesized Examples

We present a generative framework for generalized zeroshot learning where the training and test classes are not necessarily disjoint. Built upon a variational autoencoder based architecture, consisting of a probabilistic encoder and a probabilistic conditional decoder, our model can generate novel exemplars from seen/unseen classes, given their respective class attributes. These exemplars can s...

متن کامل

Improving zero-shot learning by mitigating the hubness problem

The zero-shot paradigm exploits vector-based word representations extracted from text corpora with unsupervised methods to learn general mapping functions from other feature spaces onto word space, where the words associated to the nearest neighbours of the mapped vectors are used as their linguistic labels. We show that the neighbourhoods of the mapped elements are strongly polluted by hubs, v...

متن کامل

Ordinal Zero-Shot Learning

Zero-shot learning predicts new class even if no training data is available for that class. The solution to conventional zero-shot learning usually depends on side information such as attribute or text corpora. But these side information is not easy to obtain or use. Fortunately in many classification tasks, the class labels are ordered, and therefore closely related to each other. This paper d...

متن کامل

Zero-Shot Kernel Learning

In this paper, we address an open problem of zero-shot learning. Its principle is based on learning a mapping that associates feature vectors extracted from i.e. images and attribute vectors that describe objects and/or scenes of interest. In turns, this allows classifying unseen object classes and/or scenes by matching feature vectors via mapping to a newly defined attribute vector describing ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2022

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2022.3190678